The Plant Genome
○ Wiley
Preprints posted in the last 30 days, ranked by how well they match The Plant Genome's content profile, based on 53 papers previously published here. The average preprint has a 0.03% match score for this journal, so anything above that is already an above-average fit.
Proma, S.; Garcia-Abadillo, J.; Sagae, V. S.; Sacks, E.; Leakey, A. D. B.; Zhao, H.; Ghimire, B. K.; Lipka, A. E.; Njuguna, J. N.; Yu, C. Y.; Seong, E. S.; Yoo, J. H.; Nagano, H.; Anzoua, K. G.; Yamada, T.; Chebukin, P.; Jin, X.; Clark, L. V.; Petersen, K. K.; Peng, J.; Sabitov, A.; Dzyubenko, E.; Dzyubenko, N.; Glowacka, K.; Nascimento, M.; Campana Nascimento, A. C.; Dwiyanti, M. S.; Bagment, L.; Shaik, A.; Jarquin, D.
Show abstract
Genomic selection holds the potential to serve as a strategic tool to enhance the genetic gain of complex traits in Miscanthus breeding programs. The development of improved cultivars requires their assessment for various traits across diverse environments to ensure suitable overall performance. Hence, the multi-trait multi-environment (MTME) genomic prediction (GP) models offer an opportunity to improve selection accuracy. This study aims to evaluate the potential of five GP models: (1) three MTME models including genotype-by-trait-by-environment interaction (GxExT) and (2) two single-trait multi-environment (STME) models (with and without GxE interaction). A Miscanthus sacchariflorus population comprising 336 genotypes evaluated in three environments and scored for four traits (biomass yield YDY, total culm number TCM, average internode length AIL, and culm node number CNN) was analyzed. The predictive ability of the models was evaluated considering three cross-validation schemes resembling realistic scenarios (CV1: predicting new genotypes, CVP: predicting missing traits in a given environment, and CV2: predicting partially observed genotypes). On average, in all cross-validation schemes compared to the STME the predictive ability of the MTME models was 10% to 70% higher for TCM and AIL. On the other hand, for YDY and CNN, both STME models performed similarly or slightly better (between 5 to 64%) than the MTME models in most environments. While the MTME models were not successful for all traits when compared to their STME counterparts, MTME models improved the prediction of the performance of genotypes that were untested across environments or lacked trait information in a specific environment. Overall, our study suggests that MTME GP models can be implemented in Miscanthus breeding programs to improve the predictive ability of the complex traits, shorten breeding cycles, and accelerate selection decisions.
Proma, S.; Lubanga, N.; Sacks, E.; Leakey, A. D. B.; Zhao, H.; Ghimire, B. K.; Lipka, A. E.; Njuguna, J. N.; Yu, C. Y.; Seong, E. S.; Yoo, J. H.; Nagano, H.; Anzoua, K. G.; Yamada, T.; Chebukin, P.; Jin, X.; Clark, L. V.; Petersen, K. K.; Peng, J.; Sabitov, A.; Dzyubenko, E.; Dzyubenko, N.; Glowacka, K.; Nascimento, M.; Campana Nascimento, A. C.; Dwiyanti, M. S.; Bagment, L.; Shaik, A.; Garcia-Abadillo, J.; Jarquin, D.
Show abstract
Phenotyping high-biomass perennial crops is laborious and the rate of genetic gain in perennial crop breeding programs is typically low. So, it is especially important to identify methods that produce efficiency gains in the breeding process. Miscanthus is a C4 perennial grass with favorable characteristics for producing biomass as a feedstock for biofuels and diverse biobased products. Increasing biomass yield will increase profitability and environmental benefits, so is a key target for Miscanthus breeding. In addition, the identification of well-adapted genotypes across a wide range of environmental conditions requires the establishment of multi-environment trials (METs). Sparse testing is a genomic prediction-based strategy that reduces the phenotyping costs in METs by selecting a subset of genotypes to evaluate in a subset of environments and then predicts the performance of the unobserved genotype-environment combinations. A Miscanthus sacchariflorus (MSA) population comprising 336 genotypes observed across three environments was analyzed. Three prediction models considering main effects (environments, genotypes, genomic) and interaction effects (genotype-by-environment; GxE interaction) were implemented for forecasting dry biomass yield (YDY), total culm (TCM), average internode length (AIL), and culm node number (CNN). Multiple calibration sets based on different compositions and sizes were considered to evaluate performance in terms of the predictive ability (PA) and the mean square error (MSE) for a fixed testing set size. The training set size ranged from 52 to 112 to predict a fixed set of 224 unobserved genotypes across all three environments. The results showed that the model accounting for GxE interaction presented the highest PA and the lowest MSE for CNN (PA: [~]0.77, MSE: [~]0.5) and YDY (PA: [~]0.70, MSE: [~]1.3) while for TCM and AIL these ranged from [~]0.28 to 0.41 and [~]1.3 to 4.3, respectively. Overall, varying training sets and allocation strategies did not affect PA and MSE, with 52 non-overlapping and 0 overlapping genotypes per environment as the optimal cost-effective allocation framework. This suggests that implementing sparse testing designs could significantly reduce phenotyping costs by fivefold, without compromising PA in breeding programs for perennial crops such as Miscanthus.
Blois, L.; Heuclin, B.; Bernard, A.; Denis, M.; Dirlewanger, E.; Foulongne-Oriol, M.; Marullo, P.; Peltier, E.; Quero-Garcia, J.; Marguerit, E.; Gion, J.-M.
Show abstract
Deciphering the genetic architecture of complex quantitative phenotypes remains challenging in quantitative genetics. These traits not only depend of multiple genetic factors but are also established over time and environments. Although quantitative genetics has investigated the genetic determinism of phenotypic plasticity in contrasted environmental conditions, the time related phenotypic plasticity has received less attention. Here we proposed a multivariate Bayesian framework, the Bayesian Varying Coefficient Model, designed for analysing the genetic architecture of the time related phenotypic plasticity by a multilocus approach. We applied the BVCM to time series phenotypes measured at various time scales (daily, monthly, yearly) across a diverse set of biological species. We included in this study: yeast (Saccharomyces cerevisiae), fungi (Fusarium graminearum), eucalyptus (Eucalyptus urophylla x E. grandis), and sweet cherry tree (Prunus avium). The BVCM results were compared with those obtained with a known genome-wide association method carried out time by time. For all species and traits, the BVCM was able to detect the major QTL identified by marker-trait association methods and revealed additional genetic regions of weak effect. It also increased the phenotypic variance explained for most of the phenotypes considered. It revealed dynamic QTLs with transitory, increasing or decreasing effects over time. By considering both the temporal and genetic multivariate structures in a single statistical model, we increased our understanding of the genetic architecture of complex traits notably by reducing the issue of missing heritability. More broadly, this work raises the foundation for extended applications in functional genomics, evolutionary ecology, and crop breeding programs, in which time-related phenotypic plasticity remains crucial for predicting and selecting key quantitative complex traits. Key messageBy capturing the genetic factors influencing the time related phenotypic plasticity, our approach contributes to a deeper understanding of the dynamic nature of genotype-phenotype relationships.
Shaik, A.; Sacks, E.; Leakey, A. D. B.; Zhao, H.; Kjeldsen, J. B.; Jorgensen, U.; Ghimire, B. K.; Lipka, A. E.; Njuguna, J. N.; Yu, C. Y.; Seong, E. S.; Yoo, J. H.; Nagano, H.; Anzoua, K. G.; Yamada, T.; Chebukin, P.; Jin, X.; Clark, L. V.; Petersen, K. K.; Peng, J.; Sabitov, A.; Dzyubenko, E.; Dzyubenko, N.; Glowacka, K.; Nascimento, M.; Campana Nascimento, A. C.; Dwiyanti, M. S.; Bagment, L.; Proma, S.; Garcia-Abadillo, J.; Jarquin, D.
Show abstract
Environmental factors affect crop growth and development thus their consideration across sites and years become essential for genotypic evaluation. Genomic selection (GS) has been broadly implemented to accelerate breeding cycles by skipping field evaluations thus allowing early identification of outperforming genotypes. In this study, 7,740 phenotypic records corresponding to 516 Miscanthus sacchariflorus genotypes evaluated in five locations across three years were considered for analysis. Additionally, environmental data on six weather covariates was implemented to characterize similarities between locations. Different sets of locations of variable sizes were used for model calibration based on two cross-validations (CV00 and CV0) schemes leaving out one location at a time. Predictive ability across locations of the best model varied between 0.45 and 0.90 for both schemes. These results were compared to associate predictive ability in function of weather patterns between training and testing sets to allow models calibration optimization. We found it is feasible to optimize resource allocation by considering environmentally correlated sets. In most cases, the information from only one and, at most, two locations were enough to deliver better results than using all four locations, reducing training sets by up to 75%. The results obtained shed light on helping breeders make informed decisions considering weather data when designing evaluations.
Ahlinder, J.; Waldmann, P.
Show abstract
Current optimum contribution selection (OCS) implementations use point estimates of estimated breeding values (EBVs), potentially leading to suboptimal selections when individuals have uncertain genetic evaluations. We developed a framework assessing how EBV uncertainty affects OCS decisions through MCMC-based approaches using the COSMO optimizer in Julia, evaluated on Norway spruce (Picea abies, n=5,525) and Loblolly pine (Pinus taeda, n=926) populations. Agreement between point estimate (MAP-OCS) and MCMC-OCS was surprisingly low: mean overlap of only 26.6 (4.8) individuals in Norway spruce genotyped subpopulation and 14.1 (3.6) in full pedigree, with Loblolly pine intermediate at 16.0 (9.6). Despite this low individual-level agreement, selection frequency across MCMC iterations corresponded well with EBV rankings (Spearman{rho} = 0.782 for Norway spruce), confirming that higher-EBV individuals were preferentially selected under posterior uncertainty. To comprehensively quantify uncertainty impacts, we employed two complementary metrics: individual robustness scores measuring genetic gain stability upon candidate removal, and population-level contribution distribution metrics capturing concentration of genetic gain across selected individuals. Applying these metrics identified 25 high-risk individuals in Norway spruce and nine in Loblolly pine, and constrained exclusion of these individuals improved individual robustness by 16.5% in Loblolly pine (3.00% genetic gain loss) and 29.8% in Norway spruce (2.14% genetic gain loss). Our uncertainty-aware OCS framework successfully identifies unstable selections that may compromise long-term genetic gain, and we recommend assessing EBV uncertainty through posterior distributions and evaluating population-specific trade-offs when implementing uncertainty-aware selection strategies.
Herrighty, E. M.; Specht, C. D.; Gore, M. A.; Solano, L.; Estrada-Gamboa, J.; Hernandez, C. E.; Tufan, H. A.; Landis, J. B.
Show abstract
Understanding crop genetic diversity is essential for conservation and breeding, yet farmer-maintained germplasm remains largely underrepresented in genomic studies. Theobroma cacao L. has a complex domestication history and extensive global diversity, and cacao currently cultivated in Central America, particularly in Costa Rica, has been understudied compared to South American and Mexican cultivars despite cultural and historical importance. In this study, we investigate the genetic diversity of cacao from farmer-managed systems across Costa Rica to search for Criollo germplasm and identify and characterize any unique local genetic groups. Ninety-four trees were sampled from 17 farms across four regions of the country and sequenced using whole genome resequencing. Farmer materials were analyzed alongside 166 previously characterized reference accessions representing major cacao genetic groups. Population structure analyses, phylogenetic reconstruction, and network approaches revealed that Costa Rican cacao encompasses multiple known genetic groups, including Criollo-derived lineages, while also harboring locally distinct diversity not fully represented in current global reference collections. Analyses revealed close kinship between many accessions with no clear geographic patterns corresponding to the observed population differentiation, reflecting the effects of farmers in creating dominant patterns of gene flow through seed-saving, clonal propagation, and sharing genotypes among farms. Heterozygosity levels varied substantially among individuals, consistent with a mixture of highly inbred Criollo trees and more heterozygous, admixed genotypes. We find that farmer-managed cacao systems are reservoirs of genetic diversity, including possibly rare or historically important lineages, underscoring the value of these farming systems for effective conservation and management of genomic resources for cacao resilience and improvement.
Hodehou, D. A. T.; Diatta, C.; Bodian, S.; Ndour, M.; Sambakhe, D.; Sine, B.; Felderhoff, T.; Diouf, D.; Morris, G. P.; Kane, N. A.; Faye, J. M.
Show abstract
Grain mold severely constrains sorghum [Sorghum bicolor (L.) Moench] productivity and grain quality in subhumid environments. Photoperiod-sensitive flowering plays a key role in mold avoidance and yield stability along north-south rainfall gradients. In response to the high susceptibility of elite cultivars in subhumid zones of Senegal, we developed and characterized a recombinant inbred line (RIL) population derived from Nganda (grain mold-susceptible) and Grinkan (photoperiod-sensitive) varieties. The population was evaluated across three distinct agro-ecological zones over two years. Environmental indices derived from genotype-environmental interactions, together with defined growth windows, strongly influenced flag leaf appearance (FLA), a photoperiodic flowering trait. Plasticity parameters (intercept and slope) for environmental indices, FLA, grain mold severity, and yield enabled identification of loci contributing to flowering response, mold resistance, and yield stability. The maturity gene Ma1 and two QTLs for FLA, qFLA6.2 and qFLA6.3, were identified, stable across environments, and colocalized with grain mold and yield QTLs. The wild-type Ma1 allele from Grinkan delayed FLA and reduced grain mold damage but was not associated with increased yield. The Ma1 effect was confirmed using the developed breeder-friendly KASP marker, Sbv3.1_06_40312464K, in 174 F3 three-way cross families. Photoperiod-sensitive lines with intermediate-to-late FLA alleles showed strong negative associations with mold damage. Overall, the identified stable loci and candidate lines provide foundations for effective molecular breeding of climate-resilient varieties. PLAIN LANGUAGE SUMMARYGrain mold is a fungal disease that reduces sorghum grain yield and quality, particularly in subhumid climates. With the limited number of resistant elite varieties, photoperiod-sensitive flowering to day length variation can contribute to grain mold escape at the end of rainy seasons. We characterized 286 sorghum recombinant inbred lines across three contrasting environments over two years along rainfall gradients in Senegal. Using flag leaf appearance (FLA), which is a photoperiodic flowering trait, strong genotype-environment interactions for FLA and genotypic plasticity were revealed. We identified and validated the common genomic locus associated with FLA variation and its plasticity across environments, the canonical maturity gene Ma1, which was influenced by temperature variation across environments. The presence of Ma1 in the background of photoperiod-sensitive lines enhances grain mold avoidance and yield stability along rainfall gradients in Senegal. CORE IDEASO_LIWe investigated photoperiodic flowering plasticity in sorghum as a contributor to grain mold resistance and yield stability along rainfall gradients. C_LIO_LIThe Maturity locus Ma1 (qFLA6.1) is the major contributor of photoperiodic flowering and its plasticity across semi-arid and subhumid environments. C_LIO_LIHybrid genotypes carrying two stable loci qFLA6.1 and qFLA6.2 sustain high grain mold avoidance in diverse environments. C_LIO_LIPhotoperiod-sensitive lines with medium to late flowering times are effective in avoiding grain mold, while maintaining yield stability in subhumid regions. C_LI
Moslemi, C.; Folgoas, M.; Yu, X.; Jensen, J. D.; Hentrup, S.; Li, T.; Wang, H.; Boelt, B.; Asp, T.; Sibout, R.; Ramstein, G. P.
Show abstract
Computational tools, including biological language models (LMs), show substantial promise in predicting the impact of genetic variants on plant fitness. However, validating variant effect predictions (VEP) requires experimental populations where genetic variation consists of discrete point mutations rather than segregating recombination blocks. In this study, we generated a novel population of Brachypodium distachyon mutant lines to evaluate the accuracy of VEP at single-base resolution. These lines were advanced through single-seed descent for five generations (M1 to M5), with whole-genome sequencing performed at M2 and M5 and phenotypic measurements recorded at M3 and M4. Using state-of-the-art VEP models, we predicted the functional impact of missense protein-coding variants and gene-proximal non-coding variants. We validated these predictions by estimating the effect of mutations on whole-plant measurements (burden tests) and their probability of fixation from M2 to M5 (purging tests). Among missense variants, the protein LM ESM showed superior predictive accuracy compared to the bioinformatic standard SIFT and the genomic LM PlantCAD. Notably, the relationship between VEP scores and allele fixation suggested a log-linear relationship between VEP scores and variant fitness. Among gene-proximal variants, PlantCAD appeared more accurate than supervised models of regulatory activity, such as chromatin accessibility (a2z) and RNA abundance (PhytoExpr). Collectively, our findings highlight the utility of state-of-the-art VEP tools as predictors of fitness and demonstrate the potential of mutant populations to evaluate computational tools for precision breeding applications.
Robles-Zazueta, C. A.; Strack, T.; Schmidt, M.; Callipo, P.; Robinson, H.; Vasudevan, A.; Voss-Fels, K.
Show abstract
Grapevine cluster architecture is a key selection target in breeding programs because it influences disease susceptibility, yield stability and juice quality. High-throughput phenotyping offers a rapid and non-destructive approach to capture biochemical and structural variation in these traits, yet the influence of plant organ reflectance and data partitioning strategies on trait prediction remains poorly understood. In this study, we evaluated how hyperspectral reflectance from different grapevine organs contributes to the prediction of cluster architecture and juice quality traits in two clonal populations of Riesling and Pinot. Using partial least squares regression (PLSR), we assessed the prediction accuracy of eight cluster architecture and six juice quality traits under two data partitioning strategies. Models based on cluster reflectance outperformed those using dry leaf reflectance for most traits, except for pH. Partitioning the dataset by cluster type increased trait variance and improved predictions for number of berries (R{superscript 2} = 0.53), berry diameter (R{superscript 2} = 0.79), and total acidity (R{superscript 2} = 0.48). Visible, red-edge and NIR spectra were most informative regions to predict the traits studied. Together, our results highlight the importance of organ-specific data and appropriate calibration strategies to improve phenomic models for the development of scalable proxies for grapevine improvement. HighlightSpectral phenomics reveals that prediction accuracy in grapevine depends on organ spectral signatures and traits, with cluster reflectance outperforming leaves, informing new phenotyping strategies for breeding improvement.
Monyak, T.; Morris, G.
Show abstract
Global networks of crop breeding programs leverage diverse germplasm, but diversity increases the complexity of maintaining stability in their elite genepools. To characterize genetic heterogeneity in breeding metapopulations and develop insights on how to manage it, we simulated the evolution of breeding populations on fitness landscapes. We revealed the geometric decrease in the average effect size of alleles segregating as standing variation that become fixed along an adaptive walk. We also demonstrated how independent adaptive walks of subpopulations are influenced by genetic drift, leading to cryptic genetic heterogeneity among elite genepools. This variation is released when elite lines derived from independent subpopulations are crossed, leading to segregation for 2-4X more major QTL in admixed families as in unadmixed families, and 2-4X more epistatic interactions. The emergent property of fitness epistasis for traits under stabilizing selection is well-understood in evolutionary genetics, but under-appreciated in crop quantitative genetics. To highlight the importance of this phenomenon, we constructed an empirical genotype-to-fitness landscape from the sorghum NAM, a global admixed prebreeding resource, demonstrating the utility of fitness landscapes for inferring genetic compatibilities within metapopulations. Our findings suggest that in breeding networks, strategies for effective germplasm exchange must account for epistasis in the oligogenic component of the genetic architecture of locally-adapted traits. Article summaryModern public sector crop improvement happens in networks of breeding programs that routinely exchange genetic information. Traditional models for understanding quantitative traits have limited predictiveness in situations with such genetic heterogeneity. This study uses breeding simulations and empirical data to show the utility of the fitness landscape framework for characterizing the genetic architecture of complex traits in breeding metapopulations. By simulating the evolution of breeding programs and integration into networks, it demonstrates how epistatic interactions between large-effect alleles are a fundamental property that must be accounted for when exchanging germplasm. Graphical Abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=102 SRC="FIGDIR/small/712732v1_ufig1.gif" ALT="Figure 1"> View larger version (25K): org.highwire.dtl.DTLVardef@1541326org.highwire.dtl.DTLVardef@b553a8org.highwire.dtl.DTLVardef@8758b4org.highwire.dtl.DTLVardef@1d0bdcd_HPS_FORMAT_FIGEXP M_FIG C_FIG
Zhou, W.; Zheng, J.; Zhou, S.; Guo, Y.; Kong, D.; Yang, P.; Zhang, B.
Show abstract
Soluble N-ethylmaleimide-sensitive factor attachment protein receptors (SNAREs) are essential regulators of plant growth, development, and stress adaptation. In this study, we performed a comprehensive genome-wide identification of SNARE genes in cucumber (Cucumis sativus L.), uncovering 51 putative members designated as CsSNAREs. Phylogenetic analysis confirmed that these genes cluster into five major clades: Qa-CsSNARE (14), Qb-CsSNARE (9), Qc-CsSNARE (10), Qb+c-CsSNARE (3), and R-CsSNARE (15). Bioinformatic analysis of their promoter regions, coupled with expression profiling under diverse abiotic stress conditions, highlighted a heightened responsiveness within the Qa-CsSNARE subfamily. To validate this, we selected representative Qa-CsSNARE genes for quantitative real-time PCR analysis under drought and salt stress. Among these, CsSYP121 was notably induced by salt treatment. We subsequently generated transgenic cucumber lines overexpressing CsSYP121 and challenged them with salinity. Phenotypic assessment, combined with measurements of reactive oxygen species (ROS) accumulation and K+/Na+ ratios, demonstrated that CsSYP121 overexpression (OE) confers enhanced salt tolerance and boosts antioxidant capacity. We propose a model wherein CsSYP121 mitigates ROS-induced cellular damage under salt stress, potentially through promoting K+/Na+ homeostasis, thereby improving plant performance under saline conditions. Our findings identify CsSYP121 as a promising candidate gene for breeding salt-tolerant crops.
Kumar, N.; Singh, B. P.; Mishra, P.; Rani, M.; Gurjar, A.; Mishra, A.; Shah, A.; Gadol, N.; Tiwari, S.; Rathor, S.; Sharma, P. C.; Krishnamurthy, S. L.; Takabe, T.; Mitsuya, S.; Kalia, S.; Singh, N. K.; Rai, V.
Show abstract
Salinity and sodicity stresses adversely affect rice growth and yield. To overcome yield losses, suitable tolerant rice cultivars can be developed through a marker-assisted breeding (MAB) program. In the present study, genomic regions associated with sodicity stress tolerance at the reproductive stage were identified using a high-density 50kSNP array in a recombinant inbred line (RIL) population derived from the contrasting rice genotypes CSR11 and MI48. A total of 50 QTLs were detected for various yield-related traits; further, 19 QTLs with [≥]15% of phenotypic variance were selected for integrated (omics) analysis. RNA sequencing of leaves and panicles at the reproductive stage under sodic stress conditions was employed to find differentially expressed genes. A total of 1368 and 1410 SNPs; 104 and 144 indels were found for MI48 and CSR11, respectively, within the QTL regions from resequencing. At chromosomes 1 and 6, colocalized QTLs (qPH1-1/qGP1-1 and qGP6-2/qSSI6-2) were discovered. Differentially expressed genes (DEGs) were mapped over the QTL regions selected, and SNP variations and indels were screened for colocalized QTLs. Potential candidate genes, namely Os-pGlcT1 (Os01g0133400), OsHKT2;1 (Os06g0701600) and OsHKT2;4 (Os06g0701700), OsANTH12 (Os06g0699800), and OsPTR2 (Os06g0706400), were identified as being responsible for glucose transport, ion homeostasis, pollen germination, and nitrogen use efficiency, respectively, under salt stress. Finally, our study provides important insights into the genes and potential mechanisms affecting grain yield under sodic stress in rice, which will contribute to the development of molecular markers for rice breeding programs.
Sattler, M. C.; Singh, A.; Bass, H. W.; Mondin, M.
Show abstract
BackgroundMaize knobs are regions of constitutive heterochromatin that are readily identified in both meiotic and somatic chromosomes. These structures have been characterized as stable throughout the cell cycle, exhibiting late replication during the S-phase, and are composed of two specific families of highly repetitive DNA sequences: K180 and TR-1. Although widely used as cytogenetic markers due to their variability in number and chromosomal position across inbred lines, hybrids, and landraces, little is known about their chromatin structure and dynamics. In this study, we analyzed chromatin accessibility of knobs using DNS-seq data across four maize tissues representing distinct developmental stages. ResultsOur results reveal that K180 knobs exhibit tissue-specific variation in chromatin accessibility, transitioning between open and closed states during development. In contrast, the TR-1 knob of chromosome 4 remained consistently inaccessible across all tissues analyzed. A knob composed of both K180, and TR-1 further supported this observation, with only the K180 region showing dynamic accessibility. To validate these findings, we also analyzed other repetitive regions such as centromeres, which showed a uniformly closed chromatin structure similar to TR-1. These results suggest a unique developmental modulation of chromatin accessibility associated with K180 repeats. While the chromatin accessibility of knobs does not reach the levels observed at Transcription Start Sites (TSS), the comparison among different classes of repetitive DNA within maize constitutive heterochromatin provides compelling evidence for sequence-specific and tissue-specific chromatin dynamics. ConclusionsOur findings uncover a previously unrecognized property of maize knobs and establish a reference for future studies on chromatin organization and epigenetic regulation of repetitive DNA in plant genomes.
Ji, Y.; Chaudhary, R.; Khan, N.; Perumal, S.; Wang, Z.; Moghanloo, L.; Hucl, P.; Biligetu, B.; Sharpe, A. G.; Jin, L.
Show abstract
Concerns over climate change have intensified the demand for stress resistant crops like hybrid wheatgrass (HWG; Elymus hoffmannii, StStStStHH), a perennial forage species known for its exceptional salt and drought tolerance. However, hexaploidy and high heterozygosity have complicated efforts to resolve its genomic structure and evolutionary history. Here, we present high-quality, haplotype-resolved, chromosome-level genome assemblies for HWG (CDC Saltking) and its putative progenitor, bluebunch wheatgrass (Pseudoroegneria spicata). By integrating PacBio HiFi and ultra-long Oxford Nanopore sequencing with Hi-C scaffolding, we assembled the 10.7 Gb HWG genome into 21 pseudochromosomes per haplotype. Our phylogenomic analysis redefines the origin of the H subgenome, positioning it as an intermediate between Old-World Hordeum marinum (sea barley) and Hordeum brevisubulatum. Notably, we identified significant chromosomal rearrangements, including a unique duplication on St chromosome 4. Transcriptome analysis across multiple tissues revealed a pronounced expression dominance of the H subgenome. This dominance was not associated with reduced LTR density, suggesting that selective pressures for rapid adaptation of the latest subgenome entrant may drive its dominance. Finally, using the f-branch statistic, population genomic analysis of 189 accessions representing eight Elymus and Pseudoroegneria species revealed extensive reticulate evolutionary relationships and identified P. spicata as a major, asymmetric genetic donor within the wheatgrass complex. These resources provide a foundational framework for future genomic research and genetic improvement in grasses and for the introgression of stress-tolerance traits into cereal crops such as wheat. Key MessagesDevelopment of world-first high-quality chromosomal-level haplotype-resolved genome assemblies of hexaploid HWG and diploid progenitor, Pseudoroegneria spicata, enabled the identification of the subgenome origins. This study resolved the evolutionary placement of the St genome and clarified the history of polyploidization and hybridization in HWG. Homeolog expression bias in the H subgenome likely reflects selective pressure favoring greater gene retention and upregulation of functionally important genes, thereby enhancing hybrid fitness. Population structure analysis distinctly differentiates P. spicata, E. repens, E. hoffmannii from other European Pseudoroegneria species. The findings reveal the complex patterns of interspecific gene flow and population dynamics within the Elymus and Pseudoroegneria species.
Tressel, L. G.; Caspersen, A. M.; Walling, J. G.; Gao, D.
Show abstract
Barley (Hordeum vulgare L.) is an important crop in the world and its seed dormancy is primarily controlled by a Mitogen-Activated Protein Kinase Kinase 3 (MKK3) gene. Although kinase activity of MKK3 and its roles in barley post-domestication have been widely studied, the pre-domestication evolution of MKK3 and the spread of nondormant alleles among global barley varieties remain largely unexplored. In this study, we analyzed MKK3 sequences in barley and its wild progenitor (H. spontaneum) and identified two polymorphic miniature inverted-repeat transposable elements (MITEs). Comparative analyses indicated that the insertions/excision of the MITEs predated the current estimates of barley domestication. Examination of the barley pangenomes coupled with droplet digital (dd) PCR revealed extensive copy number variation of MKK3 and suggested that transposons likely drove tandem amplification of the MKK3 gene on chromosome 5H. Additionally, approximately 1-Kb MKK3 sequences were found on chromosomes 1H and 6H. Further analysis indicated that these short MKK3 sequences were captured by a CACTA transposon that also contained fragments from four other expressed genes. The acquisition of MKK3 was estimated to be between 1.9-2.5 million years ago. Together, these findings illuminate the dynamic pre-domestication evolution of the MKK3 gene and suggest three independent origins of highly nondormant barley worldwide including a unique lineage predominant in Ethiopian germplasm. This study reveals the pivotal roles of transposons in MKK3 evolution and provide helpful information for understanding the complex history of MKK3 gene in barley and also for improving preharvest sprouting (PSH) tolerant varieties under distinct natural conditions.
Shaik, A.; Sacks, E.; Leakey, A. D. B.; Zhao, H.; Kjeldsen, J. B.; Jorgensen, U.; Ghimire, B. K.; Lipka, A. E.; Njuguna, J. N.; Yu, C. Y.; Seong, E. S.; Yoo, J. H.; Nagano, H.; Anzoua, K. G.; Yamada, T.; Chebukin, P.; Jin, X.; Clark, L. V.; Petersen, K. K.; Peng, J.; Sabitov, A.; Dzyubenko, E.; Dzyubenko, N.; Glowacka, K.; Nascimento, M.; Campana Nascimento, A. C.; Dwiyanti, M. S.; Bagment, L.; Proma, S.; Garcia-Abadillo, J.; Jarquin, D.
Show abstract
Giant Miscanthus giganteus (Mxg) is one of the most promising perennial crops to generate biomass feedstock for bioenergy and biobased products. It is derived from the natural inter-species hybridization of Miscanthus sacchariflorus (Msa) and Miscanthus sinensis (Msi) species, thus population improvement within these species is crucial. Genomic selection (GS) is an attractive option to accelerate breeding of perennial grasses, such as Miscanthus, which requires up to three years of evaluation to produce reliable phenotypic data. Hence, genotypes are observed in multiple years and locations causing inconsistent response patterns from one year to the next, between location, and/or location-by-year combinations. These inconsistencies are known as the genotype-by-environment interaction effect (GxE). Although GS has been successfully implemented in multiple annual crops where straightforward cross-validation schemes exist to assess the levels of predictive ability that can be reached, for perennial crops new cross-validation schemes will help avoid data contamination. Here, we propose a series of cross-validation schemes to evaluate model performance for perennial crops. We perform a case study by analyzing one panel of each species (516 genotypes of Msa, 280 genotypes of Msi) scored for biomass yield at different locations around the world over several years. The results of the different cross-validation schemes provide insights about the usefulness of GS to accelerate the breeding process of Miscanthus species. In addition, leveraging the GxE effects of different types significantly increases predictive ability (up to 10% in Msa and 30% for Msi) compared to the conventional approaches based on main effects only.
Bhalla, H.; Ankita, K.; Ahlawat, A.; Rode, S. S.; Singh, K. H.; Sankaranarayanan, S.
Show abstract
Self-incompatibility (SI), a reproductive mechanism that prevents self-pollen from fertilizing the ovule, is widespread in flowering plants, including the Brassicaceae family, where it promotes outcrossing, genetic diversity, and hybrid vigor. Although prevalent in Brassica rapa, an economically vital crop, it remains poorly characterized in widely grown varieties, such as toria and yellow sarson, with prior studies primarily focused on Brassica napus. Given its potential for hybrid breeding and crop improvement in rapeseed (B. rapa), we characterized key SI-regulatory genes, analyzing their phylogenetic relationships, structure-function dynamics, and expression patterns. Our results indicate sequence, structural, and functional homology as well as conservation with previously known candidates. This study identifies SRK, FER, and ARC1 as essential, while MLPK plays a minor role in SI for the varieties under study. Furthermore, we identified that SRK, FER, and MLPK activate ROS during the SI response, while ARC1 does not. Our findings establish a foundation for harnessing this natural system to integrate agriculturally important traits and sustain them across generations via outcrossing.
Gregoire, M.; Pateyron, S.; Brunaud, V.; Tamby, J. P.; Benghelima, L.; Martin, M.-L.; Girin, T.
Show abstract
AO_SCPLOWBSTRACTC_SCPLOWNitrogen fertilizers are essential for crop productivity but cause environmental harm, necessitating the development of cultivars that thrive under limited nitrogen. This study investigates the transcriptomic response to nitrate in Arabidopsis thaliana (a model dicot), Brachypodium distachyon (a model Pooideae), and Hordeum vulgare (barley, a domesticated Pooideae) to identify conserved and species-specific molecular mechanisms. Using RNA-seq after 1.5 and 3 hours of nitrate treatment, we found that core nitrate-responsive biological processes - such as nitrate transport, assimilation, carbon metabolism, and hormone signaling - are largely conserved across species. However, comparative analysis at gene level based on orthology revealed specificities between the species. For instance, rRNA processing was uniquely stimulated in Arabidopsis, while cysteine biosynthesis from serine and gibberellin biosynthesis were specifically regulated in Brachypodium and barley. Orthologs of key nitrate-responsive genes (e.g., NRT, NLP, TCP20) exhibited variable regulation, reflecting potential adaptations linked to domestication or nutrient acquisition strategies. These findings highlight the importance of integrating model and crop species to uncover targets for improving nitrogen use efficiency in cereals. The study provides a pipeline integrating gene ontology and orthology analyses to compare transcriptomic responses between species.
Djemal, R.; Trabelsi, R.; Ghazala, I.; Ebel, C.; Messerer, M.; Boukouba, R.; Gdoura-Ben Amor, M.; Charfeddine, S.; Elleuch, A.; Gdoura, R.; Mayer, K. F. X.; Winkler, J. W. B.; Schnitzler, J.-P.; Hanin, M.
Show abstract
Drought is a major constraint on the productivity of durum wheat across Mediterranean and North African regions. To elucidate the mechanisms underlying drought resilience, we employed a combination of scenario-controlled phenomics and flag leaf transcriptomics across ten durum wheat genotypes. These included the Tunisian landraces Chili and Mahmoudi, seven breeding lines, and the reference cultivar Svevo. The plants were grown to maturity under well-watered or long-term drought conditions in pots and rhizotrons, enabling a comprehensive assessment of growth, yield components, root architecture, physiological traits, and reaction norm plasticity. Drought markedly reduced performance, yet Chili and Mahmoudi consistently maintained superior biomass, grain number and intrinsic water use efficiency (iWUE). This was supported by balanced C/N allocation, strong osmotic adjustment, and the ability to sustain robust root systems under stress, albeit through partly divergent physiological strategies. Transcriptomic profiling revealed highly genotype specific responses, with drought tolerance unrelated to the number of differentially expressed genes. Instead, the landraces displayed distinct regulatory programs involving mainly photosynthesis protection, ABA-related transporters, osmotic adjustment pathways, and stress-responsive transcription factors. These mechanistic insights identify actionable physiological and molecular determinants of drought plasticity and provide high value targets for accelerating the breeding of climate resilient durum wheat. HighlightsIntegrated phenomics and transcriptomics revealed landrace-specific physiological and molecular mechanisms enabling superior drought resilience and identifying actionable targets for durum wheat improvement.
Salomon, J.; Enjalbert, J.; Flutre, T.
Show abstract
The genetics of interspecific groups remains largely unexplored, despite the central role of social (or indirect) genetic effects in shaping phenotypic expression within communities. Intercropping, i.e. the simultaneous cultivation of multiple crop species in the same field, offers a powerful model to harness these interspecific social effects. Such species mixtures provide well-documented agricultural benefits, yet few breeding frameworks have integrated the genetics of social interactions. Here, we address this gap by extending quantitative genetic theory to interspecific groups, with intercropping as a concrete and applied model case. We propose a quantitative genetic model that jointly analyzes intra and interspecific interactions within a unifying framework. Breeding values are decomposed into a direct component, shared in mono and mixed-crops, an interspecific social component corresponding to the effect of one species on another, and an intraspecific component that captures the social effects within a mono-genotypic stand of cloned plants. Statistically, this consists in simultaneously fitting several linear mixed models, one per stand type, all having direct breeding values in common. As no open-source software can fit such a complex mixed model, we provide such an implementation in R/C++. Simulations across various genetic (co)variance structures and sparse experimental designs showed accurate estimation of all genetic (co)variances and breeding values. With an incomplete, yet balanced design combining sole crops and intercrops, genetic gains in both systems were achievable simultaneously, enabling breeding strategies that progressively integrate intercropping into existing, sole-crop-only schemes. More broadly, this framework allows dissecting direct and social genetic effects when genotypes are observed in mono- and mixed-species situations, cultivated or not.